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1.  INTRODUCTION 


1.1  Correlation  Statistics 


In  this  article  results  are  given  on  the  large-sample  behaviour 
of  normal  order  statistics.  These  are  then  used  to  demonstrate  that 
two  correlation  statistics  for  testing  normality,  introduced  at  about 
the  same  time  by  Shapiro  and  Francia  (1972)  and  by  De  Wet  and  Venter 
(1972),  have  the  same  asymptotic  distribution,  given  and  tabulated 
by  De  Wet  and  Venter  (1972). 

A  correlation  statistic  for  goodness-of-fit  is  one  based  on  the 
usual  correlation  coefficient  for  pairs  of  random  variables. 

The  definition  is  extended  to  give  a  "correlation"  between  order 
statistics  X.  <  X„  <  . . .  <  X  of  a  random  sample  of  size  n  and 
a  suitable  set  of  constants  tin,  i»l,2,...,n.  The  correlation 
statistic  r  (X,t)  is  then  defined  by 


tnCX,*) 


l  (Xin-X>(tin‘t) 


(z  (x,  -x)2 

i  in 


E  (t  "t)2}1/2 
i  in 


where  the  sums  are  for  i  from  1  to  n  and  where  X  =*  E  Xj  /n  and 

i  in 

t  *  £  t.  /n.  The  set  (X.  ;  i«l,2,...,n)  will  be  referred  to  as  an 
^  In  in 

order  sample;  it  will  also  be  referred  to  simply  as  X,  or  X',  where 

X'  is  the  row  vector  (X,  ,X„  ,...,X  ). 

in  in  nn 

Suppose  the  test  of  fit  is  a  test  of  the  null  hypothesis  H^:  X 
comes  from  the  distribution  F(x;0),  where  9  is  a  vector  of  parameters. 
In  many  test  situations,  such  as  a  test  for  normality,  the  components  of 
0  are  location  and  scale  parameters,  a  and  0  respectively.  Suppose 


then  that  the  distribution  is  written  F(x;a,8).  Let  {W.  ,  i«l,2, . . .  ,n) 

m 


be  the  order  statistics  from  a  random  sample  of  size  n  from  F(x;0,l), 


let  E  denote  expectation,  and  let  m*  -  E(VT  )  (the  m*  's  will 

in  in  in 


usually  be  known,  or  known  with  resonable  accuracy).  We  now  note  that 


X.  can  be  written  a+8W.  ;  so  if  we  assume  that  W.  is  close  to 
in  in’  in 


its  expectation  m^n  —  which  will  generally  be  true  —  we  have  the 
representation 


(1) 


Xln  ~  a+8”ln  • 


In  other  words,  the  X^' s  should  be  roughly  linearly  related  to  the 


To  investigate  Hq  with  a  and  8  unknown  it  is  therefore 

natural  to  use  a  measure  of  how  well  the  linear  relation  (1)  holds. 

* 

By  choosing  t^n  to  be  m^n  we  obtain  an  obvious  test  statistic 
r(X,m  ).  Another  procedure  for  testing  Hq  was  Introduced  by  Shapiro 

and  Wilk  (1965).  The  test  statistic  is  based  on  a  comparison  of  the 

/\ 

estimate  8  of  the  slope  in  (2),  obtained  by  generalized  least-squares, 
with  the  estimate  of  8  obtained  from  the  sample  variance. 

l. 2  Test  for  Normality 

Consider  the  case  where  we  wish  to  test  whether  the  ordered  sample 

X  comes  from  a  Normal  distribution,  with  unknown  mean  y  and  unknown 
2  2 

variance  o  ,  written  N(y,o  ).  Then  the  W.  will  be  order  statistics 

in 

from  N(0,1);  let  these  be  called  Z,  <  Z-  <  •••  <  Z  ,  let 

m  2n  nn 

m.  ■  E(Z.  ),  and  let  m  be  the  column  vector  (m,  ,m.  , . . . ,m  )'. 

in  in"  In’  2n’  ’  nn 


Let  v^  be  the  covariance  between 

vlj  '  E<ZirfBi,i)<VV’  “d  let 

V^j»  i»j“l»«..,n.  We  now  have 


Z^n  and  Z^q,  that  is, 

V  be  the  matrix  with  entries 


E(Xln)  -  U+am^, 

that  is,  a  *  p  and  3  *  a.  The  generalized  least-squares  estimate 
of  0  is  then 


0  =  m'v“1X/(m,V~1m) 


and  the  test  statistics  proposed  by  Shapiro  and  Wilk  is 


W  -  62R4/(S2C2) 


where  S2  *  E  (X^-X)^,  X  *  Z  XAr>/xi,  and  R2  and  C2  are  respectively 


in 


-I  -1  2  2 

the  constants  m'V  m  and  m'V  v  m.  The  constants  R  and  C  are 

2 

inserted  to  make  0  <_  W  <_  1  and  clearly  S  /(n-1)  is  the  sample  variance 
2 

estimate  of  0  . 

The  use  of  this  statistic  is  limited  by  the  need  to  tabulate  the 
coefficient  vector  a'  *  m'V  ^/C;  this  was  done  by  Shapiro  and  Wilk 
(1965),  using  both  exact  and  approximate  methods,  for  n  <  50.  For 
larger  values  of  n,  Shapiro  and  Francia  (1972)  proposed  the  statistic 


W'  -  (m'X)2/{(m'm)2S2}  ; 


this  can  be  seen  to  be  equal  to  the  correlation  statistic  r*(X,m), 

that  is,  the  simple  correlation  between  X  and  m.  The  statistic 

W'  may  be  regarded  as  a  large  sample  version  of  W  because  of  the 

fact  that,  for  large  n,  we  may  write  (nonrigorously)  V  ^m  «  2m 

(Stephens,  1975);  then  W  s  M  for  large  n.  W  is  simpler  than 

W  but  is  still  difficult  to  calculate  because  of  the  need  to  evaluate 

the  m£n>  also  no  rigorous  asymptotic  theory  exists  for  W  or  W' . 

Various  authors  have  suggested  other  values  for  t  to  be 
2 

used  in  r  ,(X,t);  in  particular,  De  Wet  and  Venter  (1972)  have 
n 

proposed  t^  ■  *  $  ^(i/(n+l))  for  the  normal  test-  where  the 

function  $  ^(»)  is  defined  as  follows.  Let  <j)(u)  -  e  u  ^/(2ir)^^, 
and  let 

$(z)  -  f  $(u)du  ; 

J  —00 

then  if  y  ■  $(z),  z  ■  $  ^(y). 

In  addition,  De  Wet  and  Venter  (1972)  establish  the  following 
asymptotic  result: 

00 

2n{  1-r  (X ,  H)  }-a  £  (Y?-l)i-1  , 

n  n  l 

where  X  represents  an  ordered  random  sample  from  a  normal  distribution 

a  is  the  constant 
n 


a 


E[ 2n{l-r  (X,H)}] 


and  the  are  independent  N(0,1)  variates.  They  provide  tables 

for  this  asymptotic  distribution,  and  also  values  of  a^. 

It  is  well-known  that  s  H^n  except  for  extreme  values  and 

2 

De  Wet  and  Venter  imply  that  the  asymptotic  distribution  of  rn(X,m) 

2 

is  the  same  as  that  of  r  (X,H).  Since  W  and  W'  have  become  well 

n 

established  as  tests  of  normality,  and  have  good  power  properties,  it 

would  be  valuable  to  have  rigorous  asymptotic  theory. 

In  this  article  we  first  give,  in  Theorem  1,  approximations  for 

m^,  together  with  expressions  for  the  error  in  the  approximations. 

These  should  have  some  independent  interest,  but  in  Theorem  2  they 

2 

are  used  to  show  that  rn(X,m)  does  indeed  have  the  same  asymptotic 
2 

properties  as  r^CX.H).  The  theorems  are  given  in  Section  2,  and  the 
proofs  appear  in  Sections  3  and  4. 

1.3  Notation 

In  addition  to  the  notation  established  above,  it  will  be  convenient 

to  list  further  definitions  which  will  be  required  in  the  later  sections. 

2 

From  now  on,  X  will  refer  to  an  ordered  random  sample  from  N(u,o  ), 
and  vector  Z,  with  components  Z^n,  i*l,...,n,  will  be  a  similar 
vector  from  N(0,1).  Then  let  U^,  i*l,...,n  be  defined  by  =  4>(Zin) 
vector  U  with  components  U^,  i-l,...,n  will  then  be  a  vector  of 
order  statistics  from  the  uniform  distribution  with  limits  0,1,  written 
U(0,1);  the  dropping  of  the  second  subscript  n  in  component  is 

done  to  facilitate  the  printing  of  the  algebraic  calculations  in  Section 
3  and  4,  and  related  quantities  will  likewise  be  simplified  in  notation. 
Thus  we  define  V.  ■  -log(U.);  note  that  V.  >  V«  >  •••  >  V  are  order 


statistics  from  a  standard  exponential  distribution.  Let  be 

the  function  defined  by  Z  -  $  ^(U^)  -  ^"^{expC-V^}  -  ^(V^), 
iBli ... ,n. 

Set  Y(u)  =  4/" (log  u  ^),  and  let  s^  -  E(V^)  «  (1/v);  also 

0  “8j 

define  p^  -  e  . 

The  end  of  a  proof  will  be  marked  by  || . 


2.  TWO  THEOREMS 


2.1  An  Approximation  for  m^. 

Let  Z.  ,  m.  ,  V.  and  s.,  for  i*l,...,n,  be  as  defined  in 
m  m  li 

Section  1.3.  We  have 

THEOREM  1:  The  mean  of  Z^n  is  given  by 

min  -  $-1{exp(-si)>  +  Rin,  1  £  i  £ 


where 


| Rin I  £  Ci“1{log(n/i)r3/2 
and  C  is  a  constant  independent  of  i  and  n. 


REMARKS.  (1)  m,  *  -m.  ,  where  1 
-  in  jn’  J 


n+l-i,  so  the  theorem  covers 


1  £  i  £  n. 

"Si 

(2)  For  i  far  enough  away  from  1,  e  -  i/(n+l);  the  theorem 
then  gives  m^n  x  $  ^(i/(n+l)}  ■  H^n  above,  a  familiar  approximation  for 
m^n«  Blom  (1958)  introduced  the  idea  of  using  exponential  order  statis¬ 
tics,  although  he  expanded  Z.  about  log{i/(n+l)}  and  restricted  his 

in 

attention  to  the  case  of  i  fixed  as  n  -*■  00 . 

(3)  A  crude  expression  for  the  error  term  when  H^n  is  used  to 
approximate  m^n  was  stated  by  David  and  Johnson  (1956)  but  this  requires 
i/n  to  be  bounded  away  from  both  0  and  1 .  Lemma  6  below  shows  that 


when  Is  used  to  approximate  the  error  term  is  of  order 

i  ^{log(n/i)}  1  <_  i  £  [(n+l)/2].  Thus  $  ^(exp(-s^))  provides 

a  better  approximation,  at  least  asymptotically. 

2.2  The  next  theorem  establishes  the  equivalence  of  the  asymptotic 

distributions  for  r  (Z,m)  and  r  (Z,H). 

n  n 

THEOREM  2:  With  r  (Z,m)  and  r  (Z,H)  defined  as  in  Section  1.1, 
-  -  n  -  n  . .  1  ~  ■— -  — 

n{rQ(Z,m)  -  rn(Z,H)}  ->  0 


in  probability. 

Since,  under  HQ,  Zin  «  (X^-tO/cx,  it  follows  that  rn(X,m)  S  rn<Z,m) 

and  r  (X,H)  =  r  (Z,H).  Thus  Theorem  2  asserts  that  the  Shapiro-Francia  and 
n  n 

the  De  Wet-Venter  correlation  statistics  have  the  same  null  asymptotic 
distributions. 


In  the  next  two  sections  we  give  the  proofs  of  Theorems  1  and  2. 


3.  PROOF  OF  THEOREM  1 


3.1  Throughout  the  section  we  suppose  1  <_  i  <_  {(n+l)/2],  In  this 
paragraph  the  first  steps  in  the  proof  are  given.  These  steps  mo¬ 
tivate  a  series  of  lemmas  which  are  required  to  complete  the  proof. 

To  begin,  we  expand  Z^q  about  s^^  =  1/v.  This  gives,  using  the 

-1  ~^i 

notation  Z  =  <t>  (e  )  =  ¥  (V . )  , 

In  l 


Zin  "  'i'<si>  +  <vi-Si)r(si)  +  !(Vsi)2'i',,(ei> 


where  0^  lies  between  s^  and  V^.  Taking  expectations  gives 


min  *  E(Zin}  “  V<8i>  +  E  • 


We  need  to  evaluate  E(Vi-si)ZH'"(9^) .  Recall  that  IL  *  expC-V^),  and 

let  g .  ,m  '  be  the  density  of  U . ;  we  have 
in  -  i 


8in(u)  "  n^i-i^ul  1(l-u)n  i,  0  <  u  <  1 


Also  with  *  exp(-s^),  we  define  h^n  by: 


in 


p^[l+X(i-l)  1/2log  n]  for  i  >  l+X2log  n  , 


p^(l+X^i  ^log  n)  for  1  £  i  <  l+X2log  n  , 


with  X  ■  10.  Then 


E(Vrsi)2r(ei)  -  Y(u)du  -  Ij+12  , 


where  y(u)  -  (log  u'1-s1)24/" (0i)gin(u) »  and 

hin  1 

(2)  I.  ■  I  y(u)du  and  I~  «  [  y(u)du  . 

J0  Jh, 

in 

I  must  now  be  evaluated  by  evaluating  1^  and  I^.  To  do  this,  a 
series  of  lemmas  is  needed.  A  key  result  in  the  proof  of  Theorem  1 
is  Lemma  1. 


LEMMA  1.  The  function  Y(u)  *  T'Clog  u  3)  is  mono tonic  increasing 
and  positive  for  0  £  u  <_  0.5;  also 


(3) 


Y(u)  <  (|$-1(u)|r3  for  $_1(u)  <  0  ; 
Y(u)  <  16  for  $  3(u)  <  1  . 


COROLLARY: 

Y(u)  5  32(1+|$  3(u)|3)  3  for  $  ^(u)  <_  1,  that  is,  for  0  £  u  <_  $(1) 

PROOF  OF  LEMMA  1.  It  is  easily  shown  that  Y(u)  -  A(u)B(u)  where 
A(u)  ■  u{<j>($  3(u)}  3  and  B(u)  ■  l+u$  3(u)[${$  3(u)}]  3  . 


We  therefore  prove  that  Y(u)  is  monotonic  by  proving  that  both 


A(u)  and  B(u)  are  monotonic. 


1 


du/dx  >  0,  A(u)  is  monotonic  increasing  if  A|(x)  >  0.  We  have 
A^(x)  *  $(x)/<J>(x) ;  thus  A|(x)  *  C(x)/<J>(x),  where  C(x)  *  <|>(x)+x$(x) . 

It  is  easily  shown  that  lim  C(x)  -  0;  also  C'(x)  -  $(x),  using 

X-K-oo 

the  fact  that  <j>' (x)  *  -xt^(x);  thus  C'(x)  >  0,  and  C(x)  is  positive 
and  monotonic  increasing  for  all  x.  Hence  A^(x)  is  positive,  so  A(u) 
is  monotonic  increasing  for  all  x. 

Monotonicity  of  B(u).  Let  B(u)  =  B^(x).  Define  the  function 

D(x)  *  (l+x2)<I>(x)+x<j>(x).  We  have  D’(x)  -  2C(x)  >  0;  also  lim  D(x)«0, 

X**—00 

so  D(x)  is  positive  and  monotonic  increasing  for  all  x.  Now 
B^x)  -  l+x$(x)/4»(x)  -  C(x)/<t>(x)}  then  B^x)  -  (C '  (x)+xC(x)  }/<j)(x)  - 
{(l+x2)<I,(x) +x4)(x)}/<Kx)  -  D(x)/<j>(x).  Thus  B|(x)  >  0;  hence  B'  (u)  >  0 
and  so  B(u)  is  monotonically  increasing  for  all  x. 

It  is  well-known  (see,  for  example,  Renyi  (1970,  p.  164)  that  for 
x  <  0, 


(4)  1-x  2  <  |x|$(x)/<j>(x)  <  1  ; 

thus,  for  x  <  0,  A^(x)  =  $(x)/<p(x)  <  l/|x| ,  and  B^(x)  =  l+x<I>(x)/<(>(x)  <  x  2; 
hence  Y(u)  =  B^(x)A^(x)  <  l/|x|^  =  |$>  ^(u)|  2  for  u  <  0.5,  that  is,  for 
x  <  0.  For  0.5  <  u  <  $(1),  that  is,  for  0  <  x  <  1,  Y(u)  is  increasing, 
so  for  the  second  inequality  we  evaluate  Y(u)  at  u  ■  <I>(1);  the  value 
is  approximately  15.6.  This  completes  the  proof  of  Lemma  1. 


LEMMA  2:  There  is  a  K,  independent  of  i,  such  that  for  all  n  > 


hin  -  $(1)*  1  -  1  -  t2(ttfl)]  * 


PROOF  OF  LEMMA  2.  We  have 


l-l  .  m-1  -  ,  n-2 

I  v"1  <  x_1dx  -  log(jIr)  <  l  v"1  ; 

1  i-1  1  v-i-1 


e‘1/n(S)  <  pi  "  6  1  <^l<n 


From  the  definition  of  h,  the  lemma  follows.  II 

in  " 


LEMMA  3:  For  a  constant  cQ  (0  <  Cq  <  1) ,  there  is  a  constant  Y(Cq) 


such  that  for  0  <  u  <  Y(Cq)  <^» 


-{-log(2wu^)  <  $  1(u)  <  -{-Cglog(2TTu^) 


PROOF  OF  LEMMA  3.  By  (4)  we  have 


4><z)  | z  |  1(l-z"‘')  <  $(z)  <  4>(z)|z|_1  ; 


2  1/2 

set  v  ■  -log(2iTu  )  and  z  -  -(c^v)  ;  then 


-1  1/2  <H-(c0v)  }  , 

u  ^{-(c-v)1'2}  >  - T7T-  (l-(crtv)  1> 


/  -.1/2 
uCCpv; 


i- “C 

{l-(cQv)  ^}/{  (u»'/2tt)  0(cqv)1/2} 


h(u,cQ),  say  . 


Now  h(u,Cg)  -*•  00  as  u  +  0  so  there  is  a  constant  Yq(Cq)  such  that 
h(u.cQ)  >  1  for  0<u<  y0(cq).  Similarly 

-1*,  1/2N  „  -1A/  1/2W  1/2 
u  <P(-v  )  <  u  <p(-v  )/v 


<  1,  provided  u  <  (2ire) 


Thus,  if  Y(Cq)  *  min{y0(c0) ,  (2fTe)  },  we  have  for  0  £  u  <_  Yq(Cq) 


u  ^(-v1^2)  <  1  and  u  ^{-(c^v) ^^2)  >  1.  Lemma  3  follows  at 


once. 


LEMMA  4:  With  p.  and  h.  defined  as  in  Section  3.1.  there  is  a  K 


such  that  for  n  >  K 


I*'1  (bin)  I  >cl*‘1(pJ)l.  lliltyl 


where  C  is  independent  of  i  and  n. 


PROOF  OF  LEMMA  4.  By  Lemma  3,  when  h^n  <  Y(Cq)  with  Cq  fixed 
(0  <  cn  <  1)  but  otherwise  arbitrary  —  for  example,  cn  can  be  0.5 


T 


2  \ \l/2 


I*  Chln)|  >  {-c0log(2irh^n)} 


cJ/2[-21og{pJ(2X3)i_1(log  n)/2rr}]1/2,l  <  i  <  l+X2log n 


cJ;/2(-21og[p?{l+X(i-l)"1/2(log  n)1/2}/2?])1/2,  l+X2logn  <i  <  - 
01  L 


Using  (5)  we  obtain 


yWjItl - L°-S{  ■-2o--)lQg rl - TTT -]1/2.  1<  i<  l+X2logn 


Cq  z  |  <TA  (p")|[  1 - gf — htf 

0  1  log{n(l+XZlog  n)  1(2tt)  1/2} 


!*"  (hln)l  > 


cO/2!$~1(pi)l(1“log2/log4)X/Z’  1+xZl°8n<i  1  n(327r) 


1/2  ,n  2. 


and  so 


$‘A(hin)|  >  c1|$"1(p°)|  for  1  <  1  <  n(32Ti)_i/i  and  hin  <  y(cq) 


When  n  >  K,  by  Lemma  2,  h^n  <_  $>(1),  and  by  (5) 


n(327r)_1/2  <  1  <  [|(n+l)]  =^> ^(32tt)“1/ 2  <  pj  <  -| 


=>  h,  >  — — 
in  8/2? 


For  y(cn)  <  h  <  $(1)»  there  is  a  c.,  independent  of  1  and  n, 
u  —  in  —  / 


0  „  1 


such  that  C2  <  p^  £  -j.  From  these  results  taken  together,  there  must 
be  a  Cj  and  c^,  both  independent  of  i  and  n,  such  that  with 


n  >  K, 


■A  •  ‘-vlv  iViVT  A"  aTArWWrVW 


Vi i+ttL 


K1(hln)l  > 


C3I$  <P±>  I  for  (32ir)  n  <  i<  [y(n-H)J  , 


c4|*“1(pj)|  for  1  <  i  <  (32ir)-1/2n,  y(cq)  <  h± 


This  completes  the  proof  of  Lemma  4. 


LEMMA  5:  For  1  <  i  <  [-^(n+l)  ]  and  k  a  fixed  positive  integer 


(1-u)  kgin(u)du  ■  o(n”m) 


where  m  is  an  arbitrary  positive  real  number. 


PROOF  OF  LEMMA  5.  Let 


*  /  x  (n-l;  i-1  -(n-l)u  _  „ 

fin(u)  *  (IIT)T  U  6  »  U  ±  0  ‘ 


Lemma  3  of  Stigler  (1969,  p.  774)  gives  the  result  that  for  any  e > 0, 
there  is  an  M  depending  only  on  £  such  that  g^n(u)  f.  Mf ^n(u)  for 
all  u  ^  0  and  i  <_  (l-e)n.  Therefore 

f1  ,,  x-k  ,  x,  n(n-l) • • • (n-k+1)  f1  .  ,  x , 

4(1)  8ln  '  (n-i)---(n-l-Id-l)  J4(1)  8i(n-k><u)du 

<  3^1  f  ,  . ,  (u)du  for  n  >  4k 

Jam  un_K' 


r  1  .„x,  f1  (n-k-l)1  i-1  -u 

C  exp{-  -  n$(l)}  -.-xf  u  e 

1  J  <6  riY 


-1  -u(n-k-l)/2. 


<  C  exp{-  j  n$(l)}2i 


,i  „  { (n+l)log2}/2  „  .4(n+l)  -n$(l)/2 


This  completes  the  proof  of  Lemma  5.  [ 


3.3  We  continue  with  the  proof  of  Theorem  1.  We  assume  n  >  max(K,K') 
From  (2)  in  Section  3.1  we  have 

fhin 

^  -  J  (log  u-1-si)24f"(0i)gin(u)du 

<  C[l+l^_1<p°) I3]”1  l  v'2 
1 


2  n  -2 

using  Lemmas  1  and  4,  and  the  fact  that  E(V^-s^)  *  v  .  Also 

l2  “  X21+I22  where 


21 


4(1) 


in 


(log  u~1-si)2H'"(0i)gin(u)du 


and 


(log  u-1-s. )2'f"(0. )g.  (u)du  . 
4(1)  1  1  ln 


By  Leimna  1,  we  can  write 

i  r*(1) 

(6)  In  <  16si  8ln(u)du  , 

n . 
in 

using  the  fact  that  log  u  1  <  s.^  throughout  the  range  of  the  integral. 
To  evaluate  the  integral  in  (6)  we  use  an  argument  given  in  Lemma  4  of 
Stigler  (1969).  (The  lemma  as  stated,  however,  has  a  condition  missing. 
Using  the  notation  of  Stigler  (1969),  let  (i-1)  <  X2log  n,  h(u)  *  1,  k-0. 


and  consider  A', 
n 


r 

Jv' 


i-1  -(n-l)Y'  . 

f.,_(u)du  -  l  e  n((n-l)y^):J/j ! 


“in' 


n 


J-0 


>  “^n-1^n  -X( i-1) log  n) 


1/2 


-nu 


Suppose  i  *  log(log  n);  the  right  hand  side  will  not  then  be  o(n  ) 
for  any  m  >  0.  The  condition  b^/log  n  -►  00  is  also  necessary.  It 
is  to  avoid  this  difficulty  that  we  define  h^n  over  two  regions 
for  i.)  By  (5),  for  i  >  1+A  log  n,  we  have 


hin  *  P°U+*{l°g(n)/<i-l>>1/2] 


>  e"1/nll+X{log(n)/(i-l)>1/2] 


>  ^1~1)  [l+X{log(n)/(i-l))1/2]  «  h*n  for  n  >  K"  , 

where  K"  Is  an  appropriate  constant. 

2 

For  1  £  i  1+^  log(n)  we  have 

h^n  *  p^(l+X^i  ^“log  n)  >  .45X^(n-l)  Hog  n  for  n  >  K1"  , 
where  K,M  is  an  appropriate  constant. 

using  the  fact  that  >  (i  -  j)/ (n  +y) .  Thus  for  i  >  1  +  X2log  n, 
noting  that  X  ■  10  and  fin(u)  is  decreasing  for  u  >  (i-l)/(n-l), 
we  find 
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r*<D 


in 


8in^du 


<«[' 


.  fln(o)iu 
‘in 


<  Cn[l+X{log(n)/(i-l)}1/2]i'1e‘X{(i‘1)log  n} 


1/2 


-  0(n-2)  , 


since  (l+a/r)r  £  exp[a-{a2/ (12r) }] ,  0  £  a  £  r. 


2  3-1 

For  1  £  i  £  1+A  log  n,  we  have,  using  wfl  *  0.451  (n-1)  log  n. 


f$(l)  r*U) 

gln(u)du  <  M  j  fln(»)<lu 

“in  h- 


C*(D 
in 

£  M  I  f4_(u)du 

*  u 


in' 


£  M{exp(-.225A3log  n)}2i  J  ■  ui-1e"u 


-1  -u(n-l)/2 


du 


Hence 


£  M  exp[-{.225A  -(l+X2)log  2)log  n] 


o(n“2)  . 


-1  2 
I21  “  °^n  log 


*  -Qi 

and,  writing  u  for  e 


9 
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u  (1— u  ) 


$(l)  4.{$_1(u)*} 


r /,  .  U  (1-U  )$  (U  ,.-2  , 

[(1-u  )  + - - — - - ]  (1-u)  g  (u)du 

^(u*)}  in 


4>{$" 


0  *  *  -1  -1 
since  <  u  <  u  and  hence  (1-u  )  <  (1-u) 

Note  that  (4)  holds  .for  x  >  0  with  $(x)  replaced  by  l-$(x). 

We  apply  (4)  with  x  ■  $(u  );  this  allows  us  to  bound  the  term  in 

*  *  1  *  1 
1^2  involving  u  irrespective  of  whether  u  <  or  u  >2-. 

Thus  by  Lemma  5,  we  have 


2  -3 

Z22  —  C  si  n 

Finally  by  equation  (5)  and  Lemma  3,  we  obtain 

1*"%°)!  >  K1^)!  >  C(log(f)}1/2,  1  <  i  <  t|(n4-l)l  . 


This  completes  the  proof  of  Theorem  1.  j 


4.  PROOF  OF  THEOREM  2 


and 

<&~1(p°)-$’*1(p1)  -  [si-log{(n+l)i”1}]pi/({){$’:L(pi)}  + 

\[  s1-log{  (n-t-1)  i"1}  ]  V  (0*  ) 

where 

log{(n+l)/i}  <  0*  <  si  . 


XI 


|«-H||2  -  2  J  <»ln-Hln)2 


I(n+l)/2]  ,  rn  , 

<.  c'  l  i  Z{log(n/i)}  1  <  c"n  1  (log  x)  ci 
i-1  J2 


1  c (log  n)_1  , 


where  c,  c',  c"  are  constants  Independent  of  n. 


LEMMA  8:  The  norm  of  vector  H  satisfies 


n-5  log  n  £  ||  H||  <  n+1  for  n  >  24 


PROOF  OF  LEMMA  8:  For  the  second  inequality 


|H||*  <_  (n+1)  [  [♦“1(u)]‘du  ■  n+1  . 

Jo 


For  the  first  Inequality 


y  [  (n+l)/2]  .  , 

||H|lZ  -  2  l  [*_1U/(n+l)}]Z 
1 


>  2 (n+1) 


r1/2  -i 

.X 

1 1  \  A 


(u)]2du 


(n+1) 


f$-1(l/(n+l))  ? 

(n+1)  [1-2  u  <j>(u)du] 


-  (n+l)[l-2|*”1{l/(n+l)}|(|>[^“1{l/ (n+l)}]-2(n+l)  j 


^  nwwwiww  ito  i*  w  v.  ^  t  : 


TTT7TTT  •■'  V  >.’  V '-.' ; '  •'  7  •.'  ’.'  r?  ^^■,,7/-  lr-.T.' 


4 


5 


* 


V 


-  •» 


>  n-S^Ml/Cn+lHl^tt-rtlAiHd)}]2-!)'1  by  (4) 


_>  n-5  log  n  by  Lemma  3  and  the  fact  that  |$  ^{l/(n+l)}|  >  1.73 


for  n  24.  || 

LEMMA  9;  The  normalised  vectors  for  m  and  H  satisfy 


^  -  ,r-ll  <  ^llsir1  ||m— h[| 

l»ll  l|H|| 


COROLLARY: 


0  <  || m ||  || H||  -  l  »lnmin  <  1 1 m— H 1 1  2  ||m||  /||h||. 


PROOF  OF  LEMMA  9  AND  COROLLARY:  This  is  the  same  as  that  given  by  Sarkadi 
(1975,  p.  447)  for  a  similar  result.  It  depends  on  the  fact  that  if 
{a^}  and  {b^}  are  both  increasing  sequences  of  n  real  numbers, 
then  Z°  ai^i  —  0  if  either  Z”  a^  ■  0  or  z”  b^  *  0.  In  particular 


Z“  a  ^  >  0 


m  H 

FH"PT 


Then  in  the  sketch,  with  w  representing  the 
angle  between  m  and  H  , 

iipn  -  m11  cos(w/2) 

H 


<_  "length  of  any  line  from  top  of  to  a  point  on  m" 

H  m_  ii  n ,, 1 1  — 1 

H 


-  "IIH 


H 


m-H 
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where  c  Is  a  constant  independent  of  i  and  n. 

PROOF  OF  LEMMA  11:  The  formal  proof  of  this  Lemma  follows  the  same 
steps  as  those  used  to  establish  Theorem  1.  In  this  case  we  expand 
only  to  the  first  term 


where  f'(v)  ■  -e  V/$($  ^(e  V))  and  0^  is  between  and  s^. 

This  yields 

ElZin‘$"1(pi)l  -  Elvi-S±| |V* (Qi) |  . 

By  Lemma  1,  f’  (v)  *  -e  v{<|> (<E>  ^(e  V))}  ^  is  increasing  in  v  and 
is  negative.  Hence,  | XK*  (6)  |  is  decreasing  as  0  increases 
and  by  (4) 


If '(log  if1)!  < 


♦^(u)!”1  for  $_J-(u)  <  -1 


-1, 


.-1 


for  | $  (u) I  <1  . 


Therefore 

|f' (log  u  1) |  <  8{l+|$  1(u)[},  $  X(u)  £1,  that  is,  for  0  £  u  £  $(1) 


Now 


e{|v  -s  ||r(0  )|}  - 
11  1  o 


f in 

f$(l) 

+ 

+ 

Jo  J 

h._ 

(log  u_1-si| |r (0i) |gin(u)du  . 


in 


$(1) 


These  integrals  can  be  evaluated  exactly  as  before,  yielding  the  upper 


bound 


So 

n-1  ||m[| 2  <  n"1  ?  E{Z^}  -  1  . 

Also 


l-6n  Hog  n-c(n  log  n)  ^-c’n  Hog  n  , 


By  Lemmas  7,  8,  and  10.  | 


LEMMA  13:  The  norms  of  vectors  m  and  H  satisfy 


1/2 


-1 


M  -||H| 


<_  cn  1log  n  , 


where  c  is  a  constant  independent  of  n. 


PROOF  OF  LEMMA  13:  We  have 


m  -  H 


y  (m.  -H.  )2  +  2  f  H.  (m.  -H.  ) 
^  in  in  L  ^  •»" 


in'  in  in' 


/(  m 


,  -1/2, 

<  cn  log  n 


log  n  by  Lemmas  7,  8,  10  and  12. 


4.2.  We  can  now  turn  to  the  major  proof. 


PROOF  OF  THEOREM  2:  We  have 


n 


„<rn(Z>»)-rn«,H)}  -  S'1  l  ZJvC'V;1! 


where 


_  2 


M2  -  n"1  l  m2  ,  K2  -  n"1  £  H2  S2  =  n_1  £  (Z  -Z) 

n  |  in’  n  J  in’  n  ^  in 


and 


—  i  “ 

Z  -  n  1  l  Z.  . 
L  in 


i-1 


As  S 


2  a.s. 


1,  n{rn(Z,m)-rn(Z,H) }  - *■  0  if  and  only  if 
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IMIhII) 

II 


* 


A  -  7  Z,  (m.  M*1-H.  if1)  - ►  0 

n  .*•-  in  in  n  in  n 
i»l 


-1,0, 


»— 1  tt  ll 


+  V  ^  Hinmin_"°'l  11HH } 


B  +  C  +  D  ,  say  . 
n  n  n 


Using  Markov's  inequality  — ►  0  if  E | B  |  0 .  By  Lemmas  6,  11,  12 


and  13,  we  have 


.-1  „-l| 


[ (n+l)/2j  i  in  i  i  /o  i  /j  _i 

<  c  l  (i  log(n/i)r1/Z[ci  llogCn/i)}  i/Z+{log(n/i)}A/  n  Xlog  n] 
i-1 


<  c(log  n)  -►  0  as  n  ■* 


By  the  proof  of  Lemma  6  , 


[(®^l)/2]  i  —1/2  —1  —1/2 

C  <c  l  i  X{log(n/i)}  1/Z[c'i  1{log(n/i)} 


+  (log(n/i) }^^Zn”^log  n] 


<.  c (log  n)  1  . 


,  •*,  **.  *•,  *  .  ^  < 
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